Pairwise Coupling for Machine Recognition of Hand-Printed Japanese Characters
نویسندگان
چکیده
Machine recognition of hand-printed Japanese characters has been an area of great interest for many years. The major problem with this classification task is the huge number of different characters. Applying standard ”state-ofthe-art” techniques, such as the SVM, to multi-class problems of this kind imposes severe problems, both of a conceptual and a technical nature: (i) separating one class from all others may be an unnecessarily hard problem; (ii) solving these subproblems can impose unacceptably high computational costs. In this paper, a new approach to Japanese character recognition is presented that successfully overcomes these shortcomings. It is based on a pairwise coupling procedure for probabilistic two-class kernel classifiers. Experimental results for Hiragana recognition effectively demonstrate that our method attains an excellent level of prediction accuracy while imposing very low computational costs.
منابع مشابه
Recognition of Hand-Printed Characters via Induct-RDR
The goal of character recognition research is to simplify and automate the development of character recognition algorithms. We describe here an approach based on applying preprocessing to data sets of Latin characters and then applying a machine learning approach to the data sets to build a knowledge base able to classify unseen pre-processed characters. The machine learning method, Induct/RDR,...
متن کاملRecognition of Hand Printed Characters Based on Simple Geometric Features
Problem statement: The use of computers in information processing technology nowadays is one of the main trends of office automation. For more than four decades, information from the outside world is transferred into computers in a traditional way by keying in these raw data with the help of keyboard. Most of these data are in hand printed form and very large; therefore the use of automatic rec...
متن کاملSurvey of Pattern Recognition Approaches in Japanese Character Recognition
Optical Character Recognition (OCR) in Japanese, both handwritten and printed, is difficult to perform, owing to several reasons. Firstly, the Japanese language is comprised of over 3000 characters which can be classified as syllabic characters, or Kana, and ideographic characters, called Kanji. Secondly, Japanese text does not have delimiters like spaces, separating different words. Thirdly, s...
متن کاملBlob Detection Technique Using Image Processing for Identification of Machine Printed Characters
Optical character recognition systems have been effectively developed for the recognition of printed characters. Optical character recognition is an awesome computer vision technique with various applications ranging from saving real time scripts digitally and deriving context based intelligence using natural language processing from the texts. One such application is the recognition of machine...
متن کاملPrecise Hand-printed Character Recognition Using Elastic Models via Nonlinear Transformation
Distorted character recognition is a difficult but inevitable problem in hand-printed character recognition. In this paper, we propose a character recognition method using elastic models for recognizing cursive characters with intricate structure. The models are fitted to unknown input patterns by applying the EM algorithm to minimize a measure of fittness. To avoid falling into local minima, m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001